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The  reliability  of  sleep  measures  was  calculated  over  two  nights  (and  within  the  nights 
for  twenty  young  adult  males.  Percent  time  in  stages  1,  2,  3,  and  4,  percent  movement 
time,  number  of  movements,  and  number  of  stage  changes  were  significantly  correlated 
between  subjects  over  nights.  The  percent  REM  time  and  REM  cycle  duration  were  not 
significantly  correlated  over  nights.  Within  subjects,  the  length  of  the  REM  period 
had  a  significant  negative  correlation  with  the  length  of  the  preceding  NREM  period 
but  not  with  the  following  NREM  period.  Those  data  raise  questions  as  to  the  use  of 
the  standard  sleep  measures  as  a  reliable  human  traits  in  young  male  adults. 
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ABSTRACT 

The  reliability  of  sleep  measures  Kas  calculated  over  two  nights  (and  within  the 
nights)  for  20  young  adult  males.  Percent  time  in  stages  1,  2,  3,  and  4,  percent  move¬ 
ment  time,  number  of  movements,  and  number  of  stage  changes  were  significantly 
correlated  between  Ss  over  nights.  The  percent  REM  time  and  REM  cycle  duration  wee 
not  significantly  correlated  over  nights.  Within  5s,  the  length  cf  the  REM  period  had 
a  significant  negative  correlation  with  the  length  of  the  preceding  NREM  period  but 
not  with  the  following  NREM  period.  These  data  raise  questions  as  to  the  use  of  the 
standard  sleep  measures  as  reliable  human  traits  in  young  male  adults. 
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It  is  a  common  practice  in  sleep  research 
to  discard  the  first  night  in  tlie  laboratory 
because  of  tlie  first  night  effects  (Agnew, 
Webb,  k  Williams,  1966).  The  general  as¬ 
sumption  is  then  made  that  sleep  measures 
for  subsequent  nights  are  consistent  from 
night  to  night.  While  many  studies  have 
been  concerned  with  the  consistency  and 
predictability  of  sleep  characteristics  (Wil¬ 
liams,  Agnew,  &  Webb,  1964,  1966;  Hart¬ 
mann,  1968;  Weitzman,  Kripke,  Gold- 
macher,  McGregor,  &  Nogeire,  1970),  and 
report  similar  group  means  over  several 
nights,  there  are  only  two  published  studies 
that  report  reliability  data  for  individual 
subjects  over  nights.  Webb  and  Agnew 
(1969)  computed  correlations  for  sleep 
stages  over  3  nights  for  nonclinical  subjects 
whose  ages  ranged  from  21  months  to  59 
years.  Johnson,  Burdick,  and  Smith  (1970) 
have  reported  correlations  of  sleep  charac- 
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teristics  over  12  nights  in  chronic  alcoholics 
during  withdrawal.  In  both  reports,  NREM 
sleep  was  found  to  be  consistently  more  re¬ 
liable  than  REM  sleep. 

This  paper  reports  the  analysis  if  sleep 
data  from  2  nights  of  uninterrupted  sleep  in 
a  group  of  20  healthy  young  adults.  The 
purpose  was  to  determine  if  there  are  meas¬ 
ures  of  sleep  which  are  reliable  and  can  be 
used  as  predictors  of  other  measures  of 
sleep,  both  within  subjects  and  between 
subjects. 

Method 

Twenty  male  naval  recruits,  aged  17  to  21, 
lived  in  our  sleep  laboratory  for  11  days. 
They  were  studied  one  at  a  time  during  a 
baseline  period  of  4  nights,  and  during  var¬ 
ious  experimental  periods  of  sleep  depriva¬ 
tion  and  recovery.  Each  5  rejiorted  in  on 
Monday  and  slept  that  night  in  the  sleep 
laboratory,  attached  to  all  standaid  elec¬ 
trodes  and  transducers.  These  electrodes 
and  transducers  were  not  coupled  to  our  re¬ 
cording  equipment  and  no  actual  recordings 
were  run.  On  the  3  subsequent  nights,  Tues¬ 
day,  Wednesday,  and  Thursday,  the  5  was 
recorded.  Data  from  the  third  and  fourth 
baseline  nights  (nights  3  and  4)  were  used 
in  this  report. 

5s  slept  in  a  sound-treated  electrostati- 
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cally  sliieldccl  room.  Twelve  channels  of  re¬ 
cordings  were  made  on  an  OiTner-Seckman 
Type  11  Dynograpli.  Left  and  right  EOGs 
'vere  recorded  from  the  outer  canthi  refer¬ 
enced  to  the  opposite  mastoids  (A1  and 
A2).  EMG  was  recorded  from  tlie  chin. 
EEG  was  recorded  from  the  left  central 
(C3),  right  central  (C4),  and  left  occipital 
portions  of  the  scalp  (01)  referenced  to  the 
opposite  masioids.  Skin  potential  and  skin 
resistance  were  recorded  using  O’Conaell- 
Tursky  electrodes  on  the  right  hand  and 
forearm.  Heart  rate  u’as  recorded  with  a 
Beckman  model  9857  cardiotachometer. 
Finger  pulse  amplitude  and  respiration  were 
recorded  using  mercury  strain  gauges.  Re¬ 
cordings  were  run  at  a  speed  of  10  mm  per 
second. 

/Ss  were  told  “go  to  sleep”  at  about  2230 
and  were  awakened  at  about  0600,  to  pro¬ 
vide  tiiem  with  about  7  to  71/0  hours  of 
sleep.  The  range  was  from  412  to  458  iitin- 
utes,  e.vcept  for  2  Ss  who  only  had  390  min 
due  to  apparatus  dilliculties  at  the  begin¬ 
ning  of  the  night. 

All  records  were  scored  for  sleep  stages  in 
30-sec  epochs  according  to  the  criteria  in 
the  Manual  of  the  Association  for  the  Psy- 
chophj-siological  Study  of  Sleep  (Recht- 
schafl'en  &  Kales,  19G8).  The  percent 
agreement  for  ejtoch  by  oi)och  scoring  in 
this  laboratory  varies  from  85  to  94  |)crcent. 
All  the  records  were  scored  by  the  same 
|)erson,  whose  scoring  reliability  is  consist¬ 
ently  near  90  percent.  Any  set  of  succc.^sive 
30-scc  eitochs  scored  as  stage  REM  was 
called  a  “REM  episode.”  When  REM  epi¬ 
sodes  were  separated  by  less  than  15  min, 
the  entire  time  interval  from  the  onset  of 
the  first  REM  episode  to  the  offset  of  the 
last  REM  episode  was  called  a  “REM  jte- 
riod.”  This  agrees  with  the  criteria  used  by 
previous  investigators  fe.g.,  Hartmann, 
1908;  Globus,  1970). 

Results 

The  average  total  sleep  time  on  night  3 
was  42S  min,  with  a  range  of  371  to  403 
min;  the  average  on  night  4  was  435  min, 
with  a  range  of  390  to  458  min.  Averages  of 
various  sleep  measvires,  as  well  as  the  aver¬ 
age  percentage  of  time  spent  in  each  sleep 
stage,  are  shown  in  Table  1.  A  zero-mu  t 
test  for  correlated  means  on  the  14  meas- 


TABLE  1 

Means,  standard  deviations,  and  between-night 
correlations  of  sleep  7neasurcs<‘ 


Sleep 

Pleasures 

Statistic 

xisht  3 

Xi£ht4  j 

(Si  X  NO 

%Wako 

X 

2.005 

3.030 

.301 

s 

1.435 

2.573 

%Stage  1 

X 

4.265 

5.030 

.540** 

s 

2.233 

2.140 

%l{EiM 

X 

25.515 

23.970 

.191 

s 

0.002 

4.757 

%Stagc  2 

X 

50.500 

51.055 

.440* 

s 

6.096 

0.530 

%Stage  3 

X 

7.125 

0.650 

.380* 

s 

3.007 

2.973 

%Stage  4 

X 

12.510 

13.295 

.400* 

s 

5.507 

4.977 

%MT 

X 

2.325 

2.120 

.451* 

s 

1.527 

1.192 

’l'.ST 

X 

428.800 

435.900 

.127 

s 

24.098 

19.087  i 

WBl 

X 

7.075 

8.350 

.198 

s 

5.751 

7.692 

1 

WAl 

X 

2.225 

5.550 

-.044 

s 

2.907 

10.050 

TB2 

X 

14.050 

10.393 

.258 

s 

9.434 

18.875 

1 

1 

TUB 

X 

81.025 

92.875 

J  .125 

s 

23.852 

25.522 

1 

KuMT 

X 

13.200 

14.900 

'  .020** 

s 

7.951 

7.091 

NuSC 

X 

i  04.100 

65.700 

1  .558** 

1 

24.859 

24.802 

“  Abbi-cviatioiis  'fable  1  and  'fable  2:  M'f, 
movement,  time;  'fS'f,  total  sleep  time;  WBl, 
wake  time  before  stage  1;  WAl,  wake  time  after 
stage  1;  'fB2,  time  to  first  stage  2  onset;  'fBll, 
time  to  fir.st  U  EM  onset ;  XuM'f ,  number  of  move¬ 
ments;  NuSC,  number  of  stage  changes. 

*  p  <  .05,  one-tail  tost. 

**  p  <  .01,  one-tail  test. 

tiros  revealed  no  significant  differences  be¬ 
tween  night  3  and  night  4.  The  Pearson 
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TABLE  2 

Within  -nighl  correlations  of  sleep  ineasnres 


Sleep  .Measures 

Correlations 

Night  3 

>hight  4 

%UEM  X  %Stage  2 

-.30-1 

-.120 

%1{EM  X  %Stages  3  4 

-  .484* 

-.015 

%BEM  X  %MT 

.147 

.144 

%Stage  2  X  %Stagc.s  3  -f-  4 

-.274 

-  .004** 

%Stage  3  X  %Stnge  4 

-.120 

-  .395* 

TB2  X  TMT 

.000 

-.110 

%MT  X  %Stage  1 

.344 

—  .040 

NuMT  X  NuSC 

.717** 

.591** 

%Wake  X  %MT 

.113 

.283 

%Wake  X  %Stage  1 

.130 

.105 

*p  <  .05,  two-tail  test. 

**  p  <  .01. 


product-moment  correlations  for  the  2 
nights  between  Ss  are  presented  in  the  last 
column  of  Table  1.  Significant  positive  cor¬ 
relation  coefficients  were  found  for  percent 
sleejj  in  stages  1,  2,  3,  and  4,  percent  move¬ 
ment  lime,  number  of  movements,  and  the 
number  of  stage  changes.  (Correlations 
were  also  calculated  for  time  in  min  but 
gave  identical  results.)  Rank-order  rhos 
differed  from  the  product-moment  correla¬ 
tions  by  ±.14,  indicating  that  deviations 
from  bivariate  normality  were  insignificant. 

Within-night  between-subjects  Pearson 
product-moment  correlations  for  10  pairs  of 
sleej)  measures  are  given  in  Table  2.  The 
only  correlation  that  was  significant  for 
both  nights  was  that  between  the  number  of 
movements  and  the  number  of  stage 
changes. 

REM’lSonREM  Cycle  Duralion 

The  duration  of  the  first  REM  cycle  each 
night  was  measured  in  minutes  from  the 
onset  of  the  first  REM  period  to  the  onset 
of  the  second  REM  period.  The  duration  of 
the  second  REM  cycle  of  the  night  was 
measured  from  the  onset  of  the  second 
REM  period  to  the  onset  of  the  third  REM 
period.  All  20  Ss  had  at  least  3  REM  peri¬ 
ods,  but  a  few  lacked  a  4th  and  5th  REM 
period.  The  length  of  the  REM  period 
about  doubled,  from  an  average  of  16  min 
for  the  first  REM  period  to  an  average  of 
35  min  for  the  fourth  REM  period.  The 
duration  of  the  first  REM  cycle  was  the 


shortest  of  the  night,  but  there  was  no  in¬ 
crease  in  REM  cycle  duration  after  the  sec¬ 
ond  REM  cycle.  In  Table  3  are  the  dura¬ 
tions  of  each  S’s  first  and  second  REM  cy¬ 
cles  for  nights  3  and  4. 

There  are  six  possible  correlations  among 
the  four  REM  cycles.  Only  one  was  signifi¬ 
cant:  the  correlation  between  nights  3  and  4 
for  the  first  cycle  is  .42,  significant  at  about 
the  .03  level  (one-tail  test) .  The  other  cor¬ 
relations  ranged  from  —.14  to  .28,  none 
reaching  the  .05  significance  level  (one-tail 
test).  All  correlations  were  recomputed 
using  the  Spearman  rank-order  method — 
but  with  the  same  result.  Only  the  correla¬ 
tion  for  the  first  cycle  was  significant. 

The  usual  measure  of  REM  cycle  dura¬ 
tion  is  tl'.e  average  of  all  available  cycle 
durations  during  a  night;  ordinarily,  the 
average  of  several  similar  measures  has  a 
higher  reliability  th.an  the  uncombined 
scores.  An  average  night  3  score  was  com¬ 
puted  using  the  two  to  four  cycle  duration 

TABLE  3 


Duralion  of  REM  cycle  m  7ninutcs  for  two  cycles 
and  two  nights 


Sublect 

REM  Durations  (min) 

First  REM  Cycle 

Second  REM  Cycle 

Night  I!2 

Night  1)3 

Night  B2 

Night  B3 

02S08 

80 

87  ' 

70 

70 

03UG8 

134 

120 

102 

90 

04008 

70 

91 

no 

94 

05SC8 

109 

108 

90 

118 

00R08 

82 

71 

94 

120 

07C68 

77 

04 

92 

80 

08R08 

80 

94 

158 

90 

09S08 

84 

104 

00 

07 

10S08 

83 

112 

111 

82 

11R08 

58 

50 

90 

100 

12C08 

80 

80 

00 

112 

13S08 

113 

92 

107 

120 

01R70 

70 

105 

97 

84 

02S70 

107 

83 

70 

88 

03R70 

74 

81 

107 

150 

04R70 

74 

02 

99 

84 

05S70 

70 

122 

no 

100 

OOS70 

94 

72 

124 

102 

07R70 

80 

98 

100 

72 

08R70 

i 

80 

02 

no 

89 

Mean 

85.0 

88.5 

102.8 

90.5 

Stand.  Dev. 

17.8 

19.8 

20.2 

20.7 
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scores  available  for  each  5.  An  average 
nighu  4  score  was  calculated  in  the  same 
\va} .  The  night  3  score  had  a  mean  of  92.1 
min  and  a  standard  deviation  of  17.3.  The 
nigiil  4  score  had  a  mean  of  92.8  and  a 
standard  deviation  of  11.7.  The  correlation 
between  nights  3  and  4  was  .22,  far  below 
the  .0?)  level  of  significance. 

A  within-subject  within-night  analysis  of 
the  REM  cycle  was  done  by  comparing  the 
duration  of  each  REM  period  with  the  dur¬ 
ation  of  the  previous  NREM  period.  Hold¬ 
ing  night  and  subject  constant,  this  method 
yielded  one  correlation  per  subject.  The 
mean  of  these  correlations  was  —.206  for 
night  3  and  —.402  for  night  4.  Both  were 
significant  at  the  .05  level  by  the  signed- 
ranks  test  and  the  two-tailed  zero-mu  t  test. 
An  identical  analysis  was  performed  using 
the  REM  period  paired  with  the  following 
NREM  period  The  mean  of  the  correla¬ 
tions  was  .046  for  night  3  and  —.176  for 
night  4.  Neither  was  significantly  different 
from  zero  by  the  t  test  or  signed  ranks  test. 
Recall  that  during  the  night,  the  duration 
of  the  REM  cycles  tends  to  remain  fairly 
constant.  Since  the  REM  period  duration 
about  doubles  ir  the  last  third  of  the  night, 
the  NREM  segment  preceding  the  REM 
period  must  decrease;  thus  the  negative  cor¬ 
relation.  But  the  length  of  that  REM  period 
cannot  be  used  as  an  index  of  the  length  of 
the  immediately  following  NREM  period. 

Discussion 

Since  Dement  and  Kleitman  (1957>  pub¬ 
lished  their  paper  on  cyclic  variations  dur¬ 
ing  sleep  and  detailed  a  quantitative  way  of 
scoring  sleep,  these  scores  have  been  used  as 
independent  and  dependent  variables  with 
surprising  little  concern  over  the  reliability 
of  these  measures.  The  matter  of  constancy 
is  usually  handled  as  did  Berger  (1969)  in 
an  opening  chapter  in  a  .symposium  report 
on  sleep:  “The  i)attern  of  sleep  from  night 
to  night  in  a  single  individual  remains  rela¬ 
tively  constant,  except  for  the  first  night 
si)ent  in  the  lal)oratory  when  the  subject 
takes  a  longer  time  to  fall  asleep,  tends  to 
awaken  more  frequently,  and  has  less  REM 
sleep  than  on  subsequent  nights  [p.  21].” 
l^xcept  for  the  reports  by  Webb  and  Agnew 
(1969)  and  Johnson  et  al.  (1970),  there  ap- 
l)ear  to  be  no  published  data  to  support  this 


generally  used  and  accepted  statement.  Our 
sample  is  comparable  to  the  6-19-year-old 
group  studied  by  Webb  and  Agnew  (1969). 
Only  for  stages  1,  2,  and  4  were  their  retest 
correlations  over  3  nights  (.31,  .60,  and  .48 
respectively)  statistically  significant  (p  < 
.05,  one-tail).  The  correlations  for  wake 
and  REM  were  not  significant.  Our  correla¬ 
tion  for  stage  3  is  higher  than  that  found  by 
Webb  and  Agnew,  but  the  other  five  corre¬ 
lations  agree.  REM  sleep,  which  has  been 
the  focus  of  much  research,  had  much  lower 
coeflTicients  in  both  studies.  Webb  and 
Agnew  rep'orted  lower  coeffieients  for  R,EM 
sleep  than  NREM  sleep  at  all  age  levels, 
and  the  coefficient  was  not  significant  in  the 
30-39  age  group. 

Although  most  of  our  sleep  stage  retest 
correlations  were  significantly  greater  than 
zero,  they,  as  well  as  those  by  Webb  and 
Agnew,  were  far  below  the  .7  to  .8  needed 
for  reliable  predictions.  Authors  usually  re¬ 
port  the  comparison  of  mean  values  over 
nights  and  note  their  similarity.  Our  mean 
scores  also  showed  remarkable  stability  for 
every  characteristic  measured.  Our  correla¬ 
tions  across  subjects,  however,  indicate  that 
the  similar  group  values  are  very  mislead¬ 
ing  with  respect  to  night  to  night  intrasub¬ 
ject  consistency. 

The  consistent  significant  correlations  re¬ 
ported  in  Table  2  for  the  number  of  move¬ 
ments  and  the  number  of  stage  changes 
show  that  frequently  a  stage  change  is  ac¬ 
companied  by  a  movement  and  vice-versa. 
The  number  of  movements  and  the  number 
of  stage  changes  have  the  highest  correla¬ 
tions  in  Table  1.  The  relationship  of  body 
movements  to  stage  changes  has  been  pre¬ 
viously  noted  by  Sassin  and  Johnson 
(1968).  They  were  also  impressed  with  the 
consistency  of  the  number  of  body  move¬ 
ments  over  nights.  The  same  consistency 
was  noted  by  Johnson  et  al.  (1970).  Wil¬ 
liams  et  al.  (1964)  reported  stage  change 
scores  for  16  young  adults  over  throe  nights. 
Their  average  intercorrelation  is  almost 
identical  to  our  coefficient  of  .56.  The  aver¬ 
age  duration  of  the  NREM-REM  cycle  was 
constant  over  nights,  but  the  reliability  of 
the  REM  cycle  duration  score  was  quite 
low.  The  set  of  six  correlations  calculated 
for  the  four  duration  scores  does  not  differ 
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si^'iiificaiitly  from  tliafc  expected  by  cliancc. 
Our  sample  size,  20  5s,  was  rather  small, 
but  if  the  true  correlations  are  .5  or  higher, 
our  probability  of  detecting  a  positive  cor¬ 
relation,  significant  at  the  .05  level,  is  .70  or 
better. 

Could  the  reliability  of  REM  cycles  be 
improved  by  some  other  method  than  meas¬ 
uring  the  average  interval  from  REM  onset 
to  REM  onset?  Globus  (1970)  uses  an 
auto-agreement  method  which  takes  ac¬ 
count  of  the  REM-NREM  status  during 
every  scoring  period  of  the  night.  (He  ob¬ 
tained  an  average  REM  cycle  interval  of 
102  ±  9  min.)  On  the  basis  of  his  studies 
and  others  such  as  Weitzman  et  al.  (1970), 
Globus  proposed  that  relatively  permanent 
aspects  of  the  behavior  of  patients,  such  as 
psychotic  behavior  or  depression,  might  be 
correlated  with  REM  rhythniicity.  We  are 
currently  trying  to  fit  sinusoidal  and  boxcar 
(i.e..  binary)  periodic  functions  to  binary 
REM  scores  as  well  as  graded  measures  of 
EEG  delta  activity.  These  measures  of 
REM  cycle  duration  may  well  have 
greater  reliability  than  the  present  method 
which  relies  heavily  on  estimating  the  exact 
time  of  REM  onset.  Until  other  methods 
are  iierfected,  the  low  reliability  of  the  cur¬ 
rent  method  of  estimating  REM  cycle  dura¬ 
tion  makes  it  a  poor  measure  of  any  trait  of 
the  subject,  though  it  may  be  a  sensitive 
measure  of  the  state  of  the  subject.  Our 
data  further  suggest  that  more  attention 
should  also  be  given  to  the  reliability  of 
other  sleep  measures  before  they  arc  simi¬ 
larly  used  as  stable  individual  traits. 
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